UNITED STATES PATENT APPLICATION 
for 

METHOD AND APPARATUS FOR MEMORY BANDWIDTH THERMAL BUDGETTING 

Inventors: 
David S. DeLorenzo 
Stephen W. Montgomery 
Warren R. Morrow 
Robin Steinbrecher 



Docket No. 042390. P 15056 

Prepared by: 
Paul E. Steiner 
Reg. No. 41 ,326 



Express Mail No.: EV325525617US 



042390.P15056 



METHOD AND APPARATUS FOR MEMORY BANDWIDTH THERMAL BUDGETTING 
FIELD OF THE INVENTION 

[0001] The invention relates to electronic systems, and more particularly to novel 
thermal management of devices, including memory devices. 

BACKGROUND AND RELATED ART 

[0002] With reference to Fig. 1 , an example electronic system 10 includes a 
processor 1 1 coupled to a memory 13 via a memory controller 15. For example, the 
electronic system 10 may be a general purpose computer system such as a desktop or 
laptop computer. Alternatively, the system 10 may be a special purpose system such 
as a digital audio player or personal digital assistant. 

[0003] Increasing clock speeds and transistor densities are causing temperature 
regulation challenges for high-performance microprocessors. In many cases, 
microprocessors are equipped with on-die or on-package temperature sensors to 
reduce the component activity level in response to elevated temperatures. For 
example, the clock speed may be reduced to limit the active switching power. 
Reducing the activity level is sometimes referred to as "throttling." 

[0004] Components in the memory subsystem face thermal management issues 
as well. Direct or indirect temperature sensing and feedback have been proposed for 
controlling memory temperature. For example, U.S. Patent Publication No. 
2002/0143488 describes a system which determines throttle settings using a thermal 
sensor. PCT Publication No. WO 00/1 1675 also describes controlling data transmission 
rates based on a temperature of a component, which in some examples includes a 
temperature sensor coupled to a memory module. However, the additional cost and 
complexity of a temperature sensor are less desirable for high-volume memory 
applications. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0005] Various features of the invention will be apparent from the following 
description of preferred embodiments as illustrated in the accompanying drawings, in 
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which like reference numerals generally refer to the same parts throughout the 

drawings. The drawings are not necessarily to scale, the emphasis instead being 

placed upon illustrating the principles of the invention. 

[0006] Fig. 1 is a block diagram of an electronic system. 

[0007] Fig. 2 is a flow diagram of a method for managing thermal characteristics 

of a memory device using an access budget. 

[0008] Fig. 3 is a flow diagram of a method for managing thermal characteristics 
of a device, according to some embodiments of the invention. 
[0009] Fig. 4 is a graph of an example memory access request load over time. 
[0010] Fig. 5 is a graph of access requests granted, according to some 
embodiments of the invention. 

[0011] Fig. 6 is an enlarged view of the graph of access requests from Fig. 5. 
[0012] Fig. 7 is a graph of calculated memory temperature, according to some 
embodiments of the invention. 

[0013] Fig. 8 is a graph of access request budgets versus time, according to 
some embodiments of the invention. 

[0014] Fig. 9 is a graph of total requests granted, according to some 
embodiments of the invention. 

[0015] Fig. 10 is another graph of access requests granted, according to some 
embodiments of the invention. 

[0016] Fig. 11 is an enlarged view of the graph of access requests from Fig. 10. 
[0017] Fig. 12 is another graph of calculated memory temperature, according to 
some embodiments of the invention. 

[0018] Fig. 13 is another graph of access request budgets versus time, according 
to some embodiments of the invention. 

[0019] Fig. 14 is another graph of total requests granted, according to some 
embodiments of the invention. 

[0020] Fig. 15 is a comparison graph of total requests granted, comparing some 
embodiments of the invention to another throttling technique. 
[0021] Fig. 16 is a comparison graph of calculated memory temperature, 
according to some embodiments of the invention. 
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[0022] Fig. 17 is another graph of an example memory access request load over 
time. 

[0023] Fig. 18 is a comparison graph of total requests granted, comparing some 
embodiments of the invention to another throttling technique. 
[0024] Fig. 19 is a comparison graph of calculated memory temperature, 
according to some embodiments of the invention. 

DESCRIPTION 

[0025] In the following description, for purposes of explanation and not limitation, 
specific details are set forth such as particular structures, architectures, interfaces, 
techniques, etc. in order to provide a thorough understanding of the various aspects of 
the invention. However, it will be apparent to those skilled in the art having the benefit 
of the present disclosure that the various aspects of the invention may be practiced in 
other examples that depart from these specific details. In certain instances, 
descriptions of well known devices, circuits, and methods are omitted so as not to 
obscure the description of the present invention with unnecessary detail. 
[0026] In general terms, a memory request access control algorithm, in the 
absence of direct on-package (or on-module) temperature measurement and feedback, 
utilizes an estimate to account for the cooling rate (e.g. temperature decay) of the 
memory devices based on environmental and package characteristics, and the heating 
rate due to memory access requests (e.g. reads and writes). In most systems, a goal is 
to permit a large number of granted memory access requests (as a metric of memory 
system performance), to maintain junction temperatures within desired limits, and to 
have a low computational impact from the thermal budgeting algorithm. 
[0027] With reference to Fig. 2, an example memory access request control 
method uses the following features: for every occurrence of an access request (block 
21), an access request counter is compared to a calculated budget (block 23); if the 
access request counter is within the calculated budget, then the access request is 
granted (block 27) and the access request counter is incremented (block 29); if the 
access request counter exceeds the calculated budget, then the memory access 
request is deferred (block 25). 
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[0028] 



For some embodiments of the present invention, a predicted value of 



memory device temperature is utilized to control an access request budget, as opposed 
to an access request counter. For example, an access request budget may be 
calculated at some large multiple of the access period, and requests are granted until 
the budget is exhausted. In appropriate circumstances, it is believed that electronic 
systems utilizing embodiments of the present invention provide higher memory 
bandwidths than alternate mechanisms while maintaining acceptable junction 
temperatures of memory devices. For example, it is believed that thermal budgeting 
according to some embodiments of the invention provides more realistic treatment of 
actual heating and cooling characteristics of the memory devices as compared to an 
access request counter. 

[0029] In general, physical objects reach thermal equilibrium with their 
surroundings along an exponential curve - i.e., the temperature response to a step- 
change in external conditions (Eq. 1): 



experimental, or analytical characterization, and depends upon various factors, 
including, for example, the thermal conductivity of the memory devices, the approach or 
device airflow velocity, DIMM pitch, etc. Expanding the term from Eq. 1 for equilibrium 
temperature (7"~ = T a + P * 6 ja ) provides a prediction of memory device temperature in 
accordance with physical characteristics of device operation (Eq. 2): 




^'oo 1 \ f / oo / w (Eq. 1) 

T corresponds to temperature as a function of time; 
Too corresponds to an equilibrium temperature; 
77 corresponds to an initial temperature; and 

a corresponds to a decay rate (e.g. a memory thermal time constant). 



Where: 



[0030] 



The value for a may be determined, for example, through numerical, 



-at 



(Eq. 2) 



Where: 



Ta corresponds to an ambient temperature; 
P corrresponds to a device power; and 
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Oja corresponds to a junction-to-ambient thermal resistance. 
With suitable modeling and testing to determine the various constants, the prediction of 
the memory device temperature may be relatively realistic as compared to the actual 
physical behavior of the memory device. 

[0031] According to some embodiments, various assumptions may be made for 
purposes of simplification. For example, the temperature prediction may not distinguish 
between active device power, P acftVe (related to memory read/write activity) and standby 
power, Pstandby (present whenever the memory subsystem is operating). A linear fit to 
Eq. 2 with an update period of A up date provides a memory temperature prediction in 
accordance with the following equation (Eq. 3). 

7 'request 



T n = T n _ x - 



T+P -6 

a max w ja 



•R 



count 



V 



JJ 



^update (Eq. 3) 



^update 

Where: Tn-1 corresponds to a previous temperature estimate; 
Pmax corresponds to a maximum memory power; 
frequest corresponds to an access request frequency (e.g. a clock rate); 
Aupdate corresponds to an estimator update period; and 
Rcount corresponds to a number of access requests granted. 



The term (1 7f request /A update * Rcount) corresponds to a percentage of time in each update 
period that memory access requests were granted. For example, the term corresponds 
to an activity factor. 

[0032] Simplification of Eq. 3 yields the following expression (Eq. 4). 

T n = T n _ x - [r n _! - (T a + c x • R count )]• c 2 (Eq . 4) 

Where: ci is (Pmax * Oja * 1 f /frequest / ' Aupdate); and 
C2 is (Aupdate *cc). 

This equation includes two multiply operations, one add, and two subtracts, and with 
appropriate choice of units, all operations can be integer operations. 
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[0033] With an updated knowledge of device temperature, the access request 
counter may be reset and the access request budget may be recalculated. In some 
embodiments, the access request budget may be determined by setting T n = T max in Eq. 
3, and solving for Rbudget (Eq. 5), which corresponds to R CO unt per update period (e.g., 
the number of access requests that will be granted each update period). 



^ _ ^ update '^ 



+ ' n-l 'a 



l budget 



^rnax ' ® ja ' request * Eq ' 5 ^ 



[0034] However, a detailed equation for Rbudget is not required. For example, the 
budget may only be imposed when full memory bandwidth is estimated to potentially 
cause memory junction temperatures to exceed a pre-determined temperature limit. 
For example, the temperature limit in this instance may be determined from Eq. 3 by 
setting T n = T max ar\d R C0U nt = 1 '// 'request /4 pe/ f 0d and solving for T n .-, (Eq. 6). Note that this 
equation yields a constant for a given ambient temperature. 



T l, m ft - 



T _fp .0 +T).A .(v 

'max \ max ja 1 a / ^update 



l-Vfe te -a (Eq ' 6) 



[0035] Then, in some embodiments, T n -i is compared to the temperature limit, 
Tiimit, if T n -i > Tiimit, then an access request budget may be imposed. For example, the 
imposed access request budget may be pre-calculated or calculated in accordance with 
Eq. 7. 
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T -T 

q max ' a 

''budget ' ~ ~ 



max ja / request 



[0036] For example, Rbudget may be determined by setting T n ^ = T max in Eq. 5 and 
solving for Rbudget, which is equal to R coun t per update period (e.g., the number of access 
requests that will be granted each update period). In some embodiments, a pre- 
calculated access request budget may be used for all operations. In some 
embodiments, an access request budget may be updated periodically in accordance 
with periodic updates of T a . In some embodiments, an access request budget may be 
newly calculated each time an access request budget is imposed. 
[0037] Advantageously, this equation also yields a constant for a given ambient 
temperature, and that simplification can further reduce the number of mathematical 
operations (Eq. 8): 



^budget ~ frmax )' C 3 



l budget y max 9 a ) ^3 (Eq. 8) 

Where: C3 is 1 /(Pmax * 0j a * Vf request) 



[0038] With reference to Fig. 3, an example method according to some 
embodiments of the invention is as follows. An access request occurs (block 31). A 
determination is made as to whether a predicted temperature is over a threshold (block 
33). For example, the predicted temperature is determined in accordance with the 
access request using one or more of the foregoing equations. If the predicted 
temperature is determined to not be over the threshold, the access request is granted 
(block 35). If the predicted temperature is determined to be over the threshold, an 
access request budget is determined (block 37). For example, the access request 
budget is determined using one or more of the foregoing equations. The access 
request is then processed in accordance with the access request budget (block 39). 
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For example, if the access request exceeds the determined access request budget, the 
access request may be deferred. 

[0039] Advantageously, because some embodiments of the invention do not 
impose an access request budget until a predicted temperature exceeds the threshold, 
access processing overhead is reduced. Moreover, with appropriately selected 
constants, some embodiments of the invention more accurately model the actual 
thermal behavior of the device, providing potentially higher bandwidths while 
maintaining junction temperatures within suitable limits. In addition, some embodiments 
of the invention provide relatively accurate modeling of device thermal behavior without 
the cost or complexity of physical sensors. 

[0040] With reference to Figs. 4-9, representative performance graphs are 
provided for a nominal, simulated system having the following characteristics: 

Tmax = 95° C (maximum sustained junction temperature); 

T a = 45° C (maximum ambient temperature, not updated); 

Pmax x 6ja = 100° C (temperature rise at full bandwidth); 

^request = 1 kHz; 

Aupdate = 1 .0 sec; and 
a = 0.05. 

[0041] In Fig. 4, a nominal memory access probability for a one hundred (100) 
second time simulation has a 20% probability from time zero to 25, a 75% probability 
from time 25 to 75, and a 40% probability thereafter. The foregoing characteristics are 
nominally selected for illustration purposes and do not correspond to a physical device. 
[0042] Figs. 5-6 show the number of access requests granted during each update 
period interval, with Fig. 6 showing an expanded time range. Fig. 7 shows a calculated 
device temperature. With reference to Fig. 7, the device starts at 95° C, cools during a 
period of low activity, increases during the period of sustained high bandwidth activity 
until it reaches the nominal maximum of 95° C, and finally resumes cooling when the 
activity decreases. Fig. 8 shows a calculated access request budget. For example, the 
access request budget is calculated using the full equation from Eq. 5. In this example, 
the sustainable bandwidth at T max corresponds to about a 50% activity level. Fig. 9 
shows an integrated count of total memory access requests granted over time. 
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[0043] Figs. 10-14 shows performance graphs for a simulated system having the 
same general characteristics as the system of Figs. 4-9. In the present system, the 
simplified equation from Eq. 6 is used to determine the calculated access request 
budget. Also, as can be seen in Fig. 13, no budget is imposed unless the calculated 
temperature is over a pre-determined threshold (in this example, 90° C). Because the 
simplified equation is used, more conservative timing for imposing the budget may be 
preferred to ensure that that T max is not exceeded. In comparison to the system of Figs. 
4-9, the budget is imposed a little earlier, with a small performance penalty of about 100 
access requests (e.g. using the full equation about 100 more requests are granted from 
time 37-38 in Fig. 6). 

[0044] In comparison with the method illustrated in connection with Fig. 2, 
simulations demonstrate that the method of Fig. 3 provides higher bandwidth for high 
access request rates. Performance may be similar for constant low bandwidth access 
requests, because throttling / budgeting is not a substantial factor for low access 
request rates. However, for prolonged high bandwidth requests and also for burst type 
requests, the method of Fig. 3 has better performance. 

[0045] With reference to Figs. 15, for the access request profile from Fig. 4, the 
comparison graph shows that the method of Fig. 3 (solid line) provides more total 
requests granted as compared to the method of Fig. 2 (dashed line). With reference to 
Fig. 16, the calculated temperature shows that the method of Fig. 2 (dashed line) may 
permit the device temperature to overshoot the desired maximum temperature. 
[0046] With reference to Fig. 17, another simulated access rate includes three 
burst-type accesses where the access request probability goes from 25% to 95% for a 
short duration. With reference to Fig. 18, the total number of requests granted over time 
shows how the method of Fig. 3 (solid line) grants more requests more quickly than the 
method of Fig. 2 (dashed line). Fig. 19 shows the calculated temperatures for both 
methods. 

[0047] The foregoing and other aspects of the invention are achieved individually 
and in combination. The invention should not be construed as requiring two or more of 
the such aspects unless expressly required by a particular claim. Moreover, while the 
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invention has been described in connection with what is presently considered to be the 
preferred examples, it is to be understood that the invention is not limited to the 
disclosed examples, but on the contrary, is intended to cover various modifications and 
equivalent arrangements included within the spirit and the scope of the invention. 
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